Sound localization in a median plane using an avatar robot “TeleHead” with synchronization of a listener’s horizontal head rotation

نویسندگان

  • Yukio Iwaya
  • Yusuke Masuyama
  • Makoto Otani
  • Yôiti Suzuki
چکیده

Demands for realization of communications with a high sense of presence have become great. For such communications, it is important acoustically to capture and transmit comprehensive sound space information of remote places to a local site. The authors developed an avatar robot, which was developed as a simplified version of TeleHead proposed by Toshima et al. The robot’s head moves synchronously, following the listener’s horizontal head rotation. The authors investigated perceptual sound localization accuracy in the median plane at a remote site. Results show that sound localization accuracy is improved when the robot rotates synchronously with the listener’s head rotation. Another sound localization test was conducted to examine the effects of manipulating the ratio of the rotation angle. The ratio between the listener’s actual movement and that of the robot varied systematically. Results show that the ratios only slightly affect the accuracy of perceived elevation angles, suggesting large robustness in using cues provided by head rotation. INTRODUCTION In the near future, advanced and natural communications with a person in remote places might be well realized using a system with a high sense of presence. For such communications, it is important to capture and transmit comprehensive sound space information of a remote place to a local site. Another important point of interactive communications is to reproduce a sound space so that the synthesized sound field becomes responsive to a listener’s movement. Several research results have revealed that the accuracy of sound localization can be improved in a horizontal plane when we allow movement of the head and body in both real [1, 2, 3, 8] and virtual [4, 5, 6, 7] environments. Several methods have shown great potential to realize responsiveness to listeners’ movements in sound reproduction. Wave field synthesis (WFS) [9] and boundary sound control (BoSC) [10] based on Kirchhoff–Helmholtz integral equations permit a listener to change position and execute head movements freely when in the controlled area. Furthermore, Ambisonics technique [11] allows a listener’s head movements at and near the sweet spot of listening. However, these methods require many loudspeakers to control sound fields with high accuracy. In recording, numerous microphones are also needed. Sakamoto proposed a novel sound capturing system, SENZI[12]. A spherical microphone array with many microphones installed is used for capturing three-dimensional sound space information. These signals are adequately converted to binaural signals using simple digital signal processing. Because the spherical microphone array is symmetric, the signal processing can be changed according to head movement that is sensed using a position sensor. An important point related to reproduction of a sound field is to synthesize the sound field so that head-related transfer functions (HRTFs) [13] of a listener are adequately convolved before sounds arrive at the ear drums. In WFS, BoSC, and Ambisonics techniques, HRTFs are naturally convolved at a listening point with the listener’s actual figures. In contrast, measurements or numerical estimations of listener’s HRTFs are needed in the SENZI system because HRTFs closely depend on a listener’s head, body, and ear shapes. However, the measurement of HRTFs for a specific listener requires a huge measurement apparatus, time, and effort [14]. The numerical estimation demands many computation resources[15, 16, 17, 18]. TeleHead [19] in a remote site can move synchronously to the person’s various head movements by sensing head movements via a position sensor attached at the person’s head. The person can listen to sounds via TeleHead as an avatar if the person wears headphones whose inputs are connected to TeleHead’s ears. The head of TeleHead can be exchanged and a listener can use her own personal figure of head as an avatar at the remote site. Because two microphones are installed at ear entrances of the dummy-head at the remote site, her own HRTFs are naturally convolved when the listener uses her own head figure as the dummy-head. Although the listener must prepare such a head figure in advance, the TeleHead appears promising to enable us to sense the whole remote sound space information interactively as an avatar of the listener with the listener’s own HRTFs. Several researchers have reported that head movement during listening in a sound space can enhance the accuracy of sound localization and the reality, or the sense-of-presence, of the perceived sound space in real and virtual environments[1, 2, 3, 4, 7]. Toshima et al. also investigated sound localization accuracy using TeleHead in a horizontal plane and median plane [20]. They used head shapes of two types and discussed the effects of head shapes. They pointed out that the localization accuracy can be improved using TeleHead even when the head shapes of TeleHead are not a listener’s own one. They also reported that synchronization of a listener’s head movements was important. However, in their experiment, TeleHead could

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Upper hemisphere sound localization using head-related transfer functions in the median plane and interaural differences

Morimoto and Aokata [J. Acoust. Soc. Jpn. (E), 5, 165–173 (1984)] clarified that the same directional bands observed on the median plane by Blauert occur in any sagittal plane parallel to the median plane. Based upon this observation, they hypothesized that the spectral cues that help to determine the vertical angle of a sound image may function commonly in any sagittal plane. If this hypothesi...

متن کامل

Gravitoinertial force magnitude and direction influence head-centric auditory localization.

We measured the influence of gravitoinertial force (GIF) magnitude and direction on head-centric auditory localization to determine whether a true audiogravic illusion exists. In experiment 1, supine subjects adjusted computer-generated dichotic stimuli until they heard a fused sound straight ahead in the midsagittal plane of the head under a variety of GIF conditions generated in a slow-rotati...

متن کامل

MEG study of sound localization in the median plane

The purpose of this study is to investigate the human brain activity relating to sound localization in the median plane. Recent studies of sound localization in the horizontal plane have revealed that right hemisphere is dominant in auditory spatial processing of sounds from different directions. In this study, the auditory stimuli were presented randomly from four different directions in the m...

متن کامل

3D sound image control by individualized parametric head-related transfer functions

It is well known that the listener’s own head-related transfer functions (HRTFs) provide accurate 3D sound image localization. However, the HRTFs of other listeners often cause degradation of localization accuracy. Though the 3D auditory display with a head-motion trucker, which provides a dynamic spatial cue to a listener, improves the rate of front-back confusion, this dynamic cue is not enou...

متن کامل

Adaptive Adjustment of the ”Sweet Spot“ for Head Rotation

Spatial reproduction in a conventional stereophonic audio system (e.g., stereo or 5.1 surround) works in a small area known as the ”sweet spot“. If the listener changes his position, the phantom source moves in the same direction and finally collapses into the nearer loudspeaker. A play-back system that adjusts the loudspeaker signals depending only on the listener’s position in real-time was e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010